-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat/pymupdf experiment #368
Conversation
@J08nY The datasets computed with Performance wise, the processing speed seems worse for
|
@dmacko232 Do we know what are the internal dependencies of Also, we're scanning some tables in FIPS documents with some java tool. Could we get rid of the java dependency as well? |
…not keys of line dictionary
…erts into feat/pymupdf_experiment
@adamjanovsky Poppler is not dependency. The java thing should not be dependency either I guess in case of pymupdf. |
Closing this. The details of what would it take to get pymupdf surpass pdftotext in terms of output quality are described in #364 |
This closes #364